Picture for Song-Chun Zhu

Song-Chun Zhu

University of California, Los Angeles

Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning

Add code
May 06, 2025
Viaarxiv icon

MetaScenes: Towards Automated Replica Creation for Real-world 3D Scans

Add code
May 05, 2025
Viaarxiv icon

Iterative Trajectory Exploration for Multimodal Agents

Add code
Apr 30, 2025
Viaarxiv icon

TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials

Add code
Apr 17, 2025
Viaarxiv icon

Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis

Add code
Apr 01, 2025
Viaarxiv icon

Decompositional Neural Scene Reconstruction with Generative Diffusion Prior

Add code
Mar 19, 2025
Viaarxiv icon

Differentiable Information Enhanced Model-Based Reinforcement Learning

Add code
Mar 03, 2025
Viaarxiv icon

Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting

Add code
Feb 26, 2025
Viaarxiv icon

Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage

Add code
Dec 20, 2024
Viaarxiv icon

Embedding high-resolution touch across robotic hands enables adaptive human-like grasping

Add code
Dec 19, 2024
Figure 1 for Embedding high-resolution touch across robotic hands enables adaptive human-like grasping
Figure 2 for Embedding high-resolution touch across robotic hands enables adaptive human-like grasping
Figure 3 for Embedding high-resolution touch across robotic hands enables adaptive human-like grasping
Figure 4 for Embedding high-resolution touch across robotic hands enables adaptive human-like grasping
Viaarxiv icon